^ PHNLDlOllO 


19 31.01.2002 

ABSTRACT: 


Hashes are short summaries or signatures of data files which can be used to 
identify the file. Hashing multimedia content (audio, video, images) is difficult because the 
hash of original content and processed (e.g. compressed) content may differ significantly. 

The disclosed method generates robust hashes for multimedia content, for 
5 example, audio clips. The audio clip is divided (12) into successive (preferably overlapping) 
firames. For each frame, the frequency spectrum is divided (15) into bands. A robust property 
of each band (e.g. energy) is computed (16) and represented (17) by a respective hash bit. An 
audio clip is thus represented by a concatenation of binary hash words, one for each fi-ame. 
To identify a possibly compressed audio signal, a block of hash words derived therefirom is 
1 0 matched by a computer (20) with a large database (2 1 ). Such matching strategies are also 
disclosed. In an advantageous embodiment, the extraction process also provides information 
(19) as to which of the hash bits are the least reliable. Flipping these bits considerably 
improves the speed and performance of the matching process. 
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Fig. 1. 


